Monte Carlo Noisy HMM Estimation and Segmental Differential Features on the Aurora2 Clean Training Evaluation
نویسندگان
چکیده
In this paper, the compensation of mismatch between clean training hidden Markov models (HMMs) and noisy test speech is addressed. The purpose is to approach the performance of Aurora2 multi-condition training but use only clean training material. The idea is to integrate three methods including (1) mean subtraction, variance normalization and ARMA filtering (MVA) post-processing for Mel-scaled cepstral coefficients (MFCCs) normalization, (2) Monte Carlo noisy HMM estimation by adding artificial noises in the linear mel-scale filterbank parameter (MELSPEC) domain and (3) novel segmental differential features for increasing recognizer’s discriminative power. Experimental results on Aurora2 clean training corpus have shown that great performance improvement was achieved. Especially, although only clean training material was used, the performance did close to the level of Aurora2 multi-condition training.
منابع مشابه
Robust Speech Recognition Using Generalized Distillation Framework
In this paper, we propose a noise robust speech recognition system built using generalized distillation framework. It is assumed that during training, in addition to the training data, some kind of ”privileged” information is available and can be used to guide the training process. This allows to obtain a system which at test time outperforms those built on regular training data alone. In the c...
متن کاملEvaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks
In this paper, we have evaluated a noisy speech recognition method based on noise reduction and acoustic model adaptation, on the AURORA2 tasks. For noise reduction method, we employed two noise reduction methods. One is an Adaptive Sub-Band Spectral Subtraction (ASBSS) method which can vary noise subtraction rate according to SNR in frequency bands at each frame. The other is a Kalman filterin...
متن کاملHMM modelling of additive noise in the western languages context
This paper is concerned to the noisy speech HMM modelling when the noise is additive, speech independent and the spectral analysis is based on subbands. The internal distributions of the noisy speech HMM’s were derived when Gaussian mixture density distributions for clean speech HMM modelling are used, and the noise is normally distributed and additive in the time domain. In these circumstances...
متن کاملRecursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
We describe a novel algorithm for recursive estimation of nonstationary acoustic noise which corrupts clean speech, and a successful application of the algorithm in the speech feature enhancement framework of noise-normalized SPLICE for robust speech recognition. The noise estimation algorithm makes use of a nonlinear model of the acoustic environment in the cepstral domain. Central to the algo...
متن کاملSpeech Enhancement Based on Snr-dependent Empirical Statistical Estimation in Log-spectral Magnitude Domain
We present a data-driven speech enhancement system based on empirical statistical estimations of speech in the log-spectral magnitude domain, where the enhancement filter is trained at each SNR index. We use a estimation method called SNRGMM, which were developed in our previous work, to cluster the training data and learn the enhancement filter at each SNR index. This measurement is later used...
متن کامل